Search results for: Eng Siong Chng

Items from 1 to 6 out of 6 results

chapter

Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech

Tomi Kinnunen, Zhi-Zheng Wu, Kong Aik Lee, Filip Sedlak, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4401 - 4404

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Voice conversion - the methodology of automatically converting one's utterances to sound as if spoken by another speaker - presents a threat for applications relying on speaker verification. We study vulnerability of text-independent speaker verification systems against voice conversion attacks using telephone speech. We implemented a voice conversion systems with two types of features and nonparallel...

chapter

Improved Keypoint Matching Method for Near-Duplicate Keyframe Retrieval

Ehsan Younessian, D. Rajan, Eng Siong Chng

2009 11th IEEE International Symposium on Multimedia > 298 - 303

2009 11th IEEE International Symposium on Multimedia (ISM 2009)

We propose a Near-Duplicate Keyframe (NDK) retrieval method that can handle extreme zooming and significant object motion. The first stage consists of eliminating false keypoint matches using symmetric property and a ratio of nearest and second-nearest neighbor distances. Then, a pattern coherency score is assigned to each pair of keyframes. These two features are combined through linear discriminant...

chapter

Subspace construction and selection for speaker recognition

Yanhua Long, Wu Guo, Bin Ma, Eng Siong Chng, more

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 4

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

In this paper, we propose a subspace construction and selection strategy (SUBS) for speaker recognition with limited training and testing speech data. Based on the individual Gaussian distributions of Gaussian mixture model (GMM), each speaker's characteristic subspace is constructed by training an SVM using the corresponding Gaussian mean vectors from the GMMs of both enrollment and imposter speakers...

chapter

Exploiting prosodic information for Speaker Recognition

Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4225 - 4228

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we study speaker characterization using prosodic supervectors with negative within-class covariance normalization (NWCCN) projection and speaker modeling with support vector regression (SVR). We also propose a segmental weight fusion (SWF) technique that combines acoustic and prosodic subsystems effectively, despite the big performance gap between the subsystems. We validate the effectiveness...

chapter

The I4U system in NIST 2008 speaker recognition evaluation

Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4201 - 4204

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper describes the performance of the I4U speaker recognition system in the NIST 2008 Speaker Recognition Evaluation. The system consists of seven subsystems, each with different cepstral features and classifiers. We describe the I4U Primary system and report on its core test results as they were submitted, which were among the best-performing submissions. The I4U effort was led by the Institute...

chapter

Discriminative Output Coding Features for Speech Recognition

O. Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

This paper presents a novel approach of discriminative acoustic feature extraction for speech recognition using output coding technique. A high dimensional feature space for higher discriminative capability is constructed by expanding MFCC coefficients with polynomial expansion. In order to fit the discriminative features in the hidden Markov model structure of speech recognition, the high dimensional...

Filter options

Keywords:
SUPPORT VECTOR MACHINES

Publication date

Set your own date range

Content availability

Available (5)
None (1)

Keywords

SPEECH (4)
FEATURE EXTRACTION (3)
NIST (3)
SPEAKER RECOGNITION (3)
TRAINING (3)
ACOUSTICS (2)
DATA MINING (2)
SUPPORT VECTOR MACHINE (2)
VECTORS (2)
ADAPTATION MODEL (1)
ADAPTATION MODELS (1)
CEPSTRAL FEATURES (1)
CHANNEL VARIABILITY (1)
CLASSIFIER (1)
DISCRIMINATIVE ACOUSTIC FEATURE EXTRACTION (1)
DISCRIMINATIVE OUTPUT CODING FEATURES (1)
ERROR RATE REDUCTION (1)
EXTREME ZOOMING (1)
FEATURE WEIGHTING ROLE (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MEAN VECTORS (1)
GAUSSIAN MIXTURE MODEL (1)
GMM SUPERVECTOR BASELINE SYSTEM (1)
HIDDEN MARKOV MODEL STRUCTURE (1)
HIDDEN MARKOV MODELS (1)
I4U SYSTEM (1)
IMAGE COLOR ANALYSIS (1)
IMAGE MATCHING (1)
IMAGE MOTION ANALYSIS (1)
INDIVIDUAL GAUSSIAN DISTRIBUTIONS (1)
INTERVIEWS (1)
JOINTS (1)
KERNEL (1)
KEYPOINT MATCHING (1)
LINEAR DISCRIMINANT ANALYSIS (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MICROPHONES (1)
NEAR-DUPLICATE KEYFRAME (1)
NEAR-DUPLICATE KEYFRAME RETRIEVAL (1)
NEGATIVE WITHIN-CLASS COVARIANCE NORMALIZATION (1)
NIST SPEAKER RECOGNITION EVALUATIONS (1)
OBJECT MOTION (1)
PATTERN COHERENCY SCORE (1)
PATTERN MATCHING (1)
POLYNOMIAL EXPANSION (1)
REGRESSION ANALYSIS (1)
ROBUSTNESS (1)
SECOND-NEAREST NEIGHBOR DISTANCE (1)
SECURITY (1)
SEGMENTAL WEIGHT FUSION (1)
SIFT KEYPOINTS (1)
SPEAKER CHARACTERISTIC SUBSPACE (1)
SPEAKER MODELING (1)
SPEAKER VERIFICATION (1)
SPEECH CODING (1)
SPEECH DATA TESTING (1)
SPEECH RECOGNITION (1)
STRUCTURE RISK CRITERION (1)
SUBSPACE CONSTRUCTION AND SELECTION STRATEGY (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SUPPORT VECTOR REGRESSION (1)
SVM (1)
SVM TRAINING (1)
SYMMETRIC PROPERTY (1)
SYSTEM FUSION (1)
TRAINING DATA (1)
VIDEO SIGNAL PROCESSING (1)
VISUALIZATION (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

Search results for: Eng Siong Chng

Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech

Improved Keypoint Matching Method for Near-Duplicate Keyframe Retrieval

Subspace construction and selection for speaker recognition

Exploiting prosodic information for Speaker Recognition

The I4U system in NIST 2008 speaker recognition evaluation

Discriminative Output Coding Features for Speech Recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options